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Abstract 

We study the integration and approximation problems for monotone and convex 
bounded functions that depend on d variables, where d can be arbitrarily large. We 
consider the worst case error for algorithms that use finitely many function values. We 
prove that these problems suffer from the curse of dimensionality. That is, one needs 
exponentially many (in d) function values to achieve an error e. 
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1 Introduction 



Many multivariate problems suffer from the curse of dimensionality. A partial list of such 
problems can be found in e.g., [HI [7]. The phrase curse of dimensionality was coined by Bell- 
man already in 1957 and means that the complexity 1 of a d-variate problem is an exponential 
function in d. This is usually proved for multivariate problems defined on the unit balls of 
normed linear spaces. We stress that the curse of dimensionality may hold independently of 
the smoothness of functions and may hold even for analytic functions. 

The choice of the unit ball as the domain of a multivariate problem is not essential and 
can be slightly generalized. What is important and heavily used in the proof is that the 
domain of the d variate problem is balanced (/ G implies — / G Fd) and convex 
{fx-, /2 ^ Fd and t G [0, 1] imply that tf\ + (1 — t)fi G F d ). It is not clear if the curse of 
dimensionality may hold for domains Fd being not balanced or not convex. 

In this paper we study classes of monotone and convex <i-variate bounded functions. 
Such classes are obviously not balanced and the previous analysis to prove the curse of 
dimensionality does not apply. We study the integration problem and the approximation 
problem in the L p norm with p G [1, oo]. We consider the worst case setting and algorithms 
that use finitely many function values. In particular, we ask what is the minimal number of 
o?-variate function values that is needed to achieve an error e. 

It turns out that the approximation problem in the L p norm for both monotone and 
convex functions is no easier than the integration problem. This means that lower error 
bounds for integration also hold for approximation. Hence, it is enough to prove the curse 
of dimensionality for the integration problem. 

The integration problem for monotone functions has been studied by Papageorgiou [8], 
and for convex functions by Katscher, Novak and Petras [I]. They proved the optimal rate 
of convergence and provided lower and upper bounds on the nth minimal error. From these 
bounds we can conclude the lack of some tractability properties defined later, but cannot 
conclude whether the curse of dimensionality holds. 

In this paper we prove that for both monotone and convex functions, the curse of dimen- 
sionality holds for the integration problem and therefore also holds for the approximation 
problem in the L p norm. The proof relies on identifying "fooling" functions / - and / + 
which are both monotone or both convex, which share the same n function values used by 
an algorithm, and whose integrals differ as much as possible. Here "as much as possible" 
means that the error is at most e only if n is exponentially large in d. The fooling functions 

■""By complexity we mean the minimal cost of computing an e-approximation. The complexity is bounded 
from below by the information complexity which is defined as the minimal number of function values needed 
to compute an e-approximation. In this paper we prove that even the information complexity suffers from 
the curse of dimensionality. 
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for the monotone class take only values or 1 depending on the points used by an algo- 
rithm. The fooling functions for the convex class are f~=0 and / + is chosen such that it 
vanishes at n points used by an algorithm, and its integral is maximized. Using the results 
of Elekes p] and Dyer, Fiiredi and McDiarmid |2] on random volumes of cubes, we prove 
that the integral of / + is of order 1 for large d, if n is smaller than, say, (12/ll) d . 

Restricting the algorithms for the integration problem to use only function values is 
quite natural. However, for the approximation problem it would be also interesting to 
consider algorithms that use finitely many arbitrary linear functionals. We believe that 
the L p approximation problem still suffers from the curse of dimensionality for this general 
information, and pose this question as an open problem. The paper by Gilewicz, Konovalov 
and Leviatan [5] may be relevant in this case. This paper presents the order of convergence 
for the approximation problem for s-monotone functions (in one variable). 

We finally add a comment on the worst case setting used in this paper. Since integration 
for monotone and convex classes suffers from the curse of dimensionality in the worst case 
setting, it seems natural to switch to the randomized setting where algorithms can use 
function values at randomized sample points. Now we can use the classical Monte Carlo 
algorithm. Since all monotone and convex integrands are bounded by one, the error bound 
of Monte Carlo is n~ 1 ^ 2 , without any additional constant. Hence, e~ 2 function values at 
randomized sample points are enough to guarantee a randomized error e. This means that 
the integration problem for both monotone and convex functions is strongly polynomially 
tractable 2 in the randomized setting. The exponent 2 of e^ 1 is optimal since the optimal 
orders of convergence for randomized algorithms are n~ 1 l 2 ~ 1 l d for monotone functions, see 
[8], and rT l l 2 ~ 2 l d for convex functions, see [3]. Hence, for large d we cannot guarantee a 
randomized error e with e~ p function values with p < 2. This proves that the switch for 
the worst case setting to the randomized setting breaks the curse of dimensionality for the 
integration problem defined for monotone and convex functions. 

Not much seems to be known about the L p approximation problem in the randomized 
setting for monotone or convex functions. It is not clear if we still have the curse of dimen- 
sionality in the randomized setting. We pose this as another open problem. 

2 Integration 

We mainly study the integration problem, i.e., we want to approximate 




2 This means that ([3]) holds with q = 0. In this case we can choose C = 1 and p = 2. 
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for bounded functions / : [0, l] d — > [0, 1] that are monotone (more precisely, non-decreasing 
in each variable Xj if the other variables are fixed) or convex. Hence, we consider the classes 

pmon = y . [ Q) [ 0) i] I / is monotone} 

and 

F d con = {/:[0,l] d ->[0,l]| / is convex}. 

We approximate the integral INT^(/) by algorithms A n that use information about / given 
by n function values. Hence, A n has the form 

A n (f) = ( f> n (f(t 1 )J(t 2 ),...,f(t n )), (1) 

where n is a nonnegative integer, (p n : M" — > R is an arbitrary mapping, and the choice of 
arbitrary sample points tj G [0, l] d can be adaptive. That is, tj may depend on the already 
computed values f(ti), ffo), ■ ■ ■ , f(tj-i). For n = 0, the mapping A n is a constant real 
number. More details can be found in e.g., [3 El [TJ |9] . 

We define the nth minimal error of such approximations in the worst case setting as 

e^(F d )=inf sup|INT d (/)-A n (/)| for F d e {F/ on , F d con }- 

A n f£F d 

For n — 0, it is easy to see that the best algorithm is A = | for the two classes considered 
in this paper, and we obtain 

e int( F mon) = ^t(|T««) = I for all d G N. 

Hence, the integration problems are well scaled and it is enough to study the absolute 
error. The information complexity is the inverse function of e 1 ^ t (Fd) given by 

n int (F d , e) = min{n | e^{F d ) < e} for F d G {F d mon , ^ c ° n }- 

It is trivial that n int (F d mon , e) = n int (F d con , e) = for all e>\. 

3 Known and new results 

The integration problems for monotone and for convex functions were studied before, we 
refer to the paper by Papageorgiou j8] for monotone functions, and to the paper by Katscher, 
Novak and Petras [1] for convex functions. Here we mention some of the known results and 
indicate our new results concerning the curse of dimensionality. 
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For the class F d mon of monotone functions it was proved by Papageorgiou [S] that 

Hence, the optimal order of convergence is n~ l l d . More precisely, it is proved in [5] that 
there are some positive numbers c, C independent of n and d such that for all d, n £ N we 
have 

cd- 1 n- 1/d < e™\F d mon ) < Cdn- l/d . (2) 

It is interesting to note that the ratio between the upper and the lower bound is of the 
order d 2 , i.e., it is polynomial in d, not exponential as it is the case for many other spaces. 
The bound ([2D yields 



ids) 



< n mt (F d mon ,e) < 



Cd^ ! 



From this we conclude that polynomial tractability and even weak tractability do not hold. 
That is, it is not true that there are non-negative C, q,p such that for all d £ N and e £ (0, |) 
we have 

n int (F™ n ,e) <Cd q e~ p (polynomial tractability) , (3) 
as well as it is not true that 

In n int (F d mon ,e) . , , . 

hm = (weak tractability). 

e- 1 +d^oo £ 1 + d 

Nevertheless, the lower bound on n int (-F d mon , e) is useless for a fixed e > and large d, since 
for d > c/e we do not obtain a bound better than n int (FJ non , e) > 1. Thus, it is not clear 
whether the information complexity n wt (FJ oon , e) is exponential in d for a fixed e £ (0, |). 
In this paper we will prove that 

n int (F d mon , e) >2 d (l- 2e) for all d £ N, e £ (0, |). 

This means that n int (F d mon , e) is indeed exponential in d, that is the integration problem 
suffers from the curse of dimensionality. 

We now turn to the class F d on of convex functions. It was proved by Katscher, Novak 
and Petras [4] that 

e^(Fr) = Q(n- 2/d ). 
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Again, the optimal order of convergence is known, now it is rT 2 l d . More precisely, it was 
proved in [1] that there are some positive numbers q, C, with q being exponentially small 
in d whereas C is independent of d, such that we have for all n G N 



c d n- 2/d < e™\F d con ) <Cdn 



-2/d 



(4) 



The bound (TJJ yields 



<n int (F d con , £ ) < 



d/2' 



From this we conclude that polynomial tractability does not hold. The lower bound in is 
useless for a fixed e > and large d, and therefore it is not clear if we have weak tractability 
or the curse of dimensionality. In this paper we will prove that there exists Eq G (0, 1/4) 
such that 

1 flV 

To 



,e > 



for all d G N, £ G (0,e ]. 



2(d+l) 

Hence, the integration problem also suffers from the curse of dimensionality for convex 
functions. 



4 The class of monotone functions 

We consider integration for monotone functions. Assume that A n is an arbitrary (possibly 
adaptive) algorithm for the class _F d mon . For x = x 2 , ■ ■ ■ , G [0, consider the 
"fooling" function 




U Et=iXk<d/2, 

1 XYLi x k>d/2. 



Obviously, /* G F d mon and therefore the algorithm A n will use function values 

r(*i),r(*a),...,r(*») 

for some sample points i,- G [0, Since the algorithm A n can on/y use the computed 
function values, we obtain 

A n (f) = A n (f*) 
for all / G Fr n if /(**) = /*(4) for fc = 1, 2, . . . , n. 
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Take first the case n = 1. Suppose first that f*{t\) = 0, i.e., ^ J=1 ^ij < d/2 for 
*i — [*i,i> *i,2) • • • 5 *i,d] - Define /~ = and the function 



/ + o*o 



if x < t\ (in every coordinate), 

1 otherwise. 



Then / , / + e FJ 11011 and they yield the same information as /*, i.e., 

f-(ti) = f + (h) = f*(ti) = o. 

Using the standard proof technique it can be checked that 

d d 

m *« Ylvj = max ]T(1 - %) = T d . 

ye[o,i] d , E"=i %<^/2 y ye[o,i] d , Ej =1 %><f/2 

Then 

P d 

INT d (/+) = 1 - INT d (l - /+) = 1 - / dx = 1 - 

Jx<ti =1 

This implies that 

INT d (/ + ) - INT d (/-) > 1 - 2~ d . (5) 
The case with = 1 is similar. Now take / + = 1 and 



1 if x > ti, 
otherwise. 



Again / + and / are from _P d mon and they yield the same information as /*. We also 
obtain (jSJ). We estimate the error of A% on the whole class F d mon by 

sup |INT d (/)-A»(/)| > max(|INT d (/+)-A»(r)UlNT d (/-)-A»(r)|) 

/gF mon 

> \ (INT,(/ + ) - INT d (D|) > | (1 - 2- d ) . 
Since this holds for all algorithms, we conclude that 

e 1 {Fr\e)>\{l-2- d ). 
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The general case with n G N is similar. Assume that £ of the function values yield 
f*{tk) — while n — £ function values yield f*(tk) = 1- Without loss of generality, we may 
assume that 

ntj) = for j = 1,2,...,*, 

r(tj) = 1 for j = e + l,£ + 2,...,n. 



Define the two functions, 

f + (x) 

and 

/» = 



if x < ti or x < t 2 or . . . or x < t e , 

1 otherwise. 

1 if x > te+i or x > ti +2 or ... or x > t n , 
otherwise. 



Then /+, /" G F™ n with 

f + (tk) = r(t k ) = f*(t k ) for all fc = l,2,...,n. 
Furthermore, we have 



n— £ „ 

INT d (/")< J] / ldx< (n-£)2- rf . 

■ =1 Jx>t e+J 



Similarly it is easy to show that INT d (/ + ) > 1 - 2~ d ■ £, so that 

INT d (/ + ) - INT d (/~) > 1 - T d ■ n. 

Therefore the worst case error of A n is at least |(1 — 2~ d n). Since this holds for an arbitrary 
A n we also have 

e„(F d mon ) > \ (1 - 2-V) . 
This leads to the following theorem. 

Theorem 1. For each fixed e G (0, |), the information complexity is at least 

n int (F d mon , e) > 2 d (1 - 2e) for all d G N. 

Thus, the integration problem for monotone functions suffers from the curse of dimension- 
ality. 
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5 The class of convex functions 

We now consider integration for convex function and prove the curse of dimensionality. 

Theorem 2. There exists Eo G (0, ~) such that for each fixed e G (0,£o) the information 
complexity is at least 

WV)>^g)'(l-£) for all d eli. 

Thus, the integration problem of convex functions suffers from the curse of dimensionality. 

The idea of the proof is as follows. Assume again that we have an arbitrary (possibly 
adaptive) algorithm A n for the class F£° n . For the zero function f~=0 the algorithm A n 
uses function values at certain sample points xi,X2, ■ ■ ■ ,x n . This implies that A n uses the 
same sample points Xi, x%, ■ ■ ■ ,x n for any function / from -F d con with 

/(xi) = f(x 2 ) = ■ ■ ■ = f(x n ) = 0. 

In particular, let / + be the largest such function, 

f + {x) = sup{/(x) I f( Xj ) = 0, j = l,2,...,n, feF d con }. 

Clearly, /+ G F d con , f + ( Xj ) = for j = 1, 2, . . . , n, f(x) > for all x G [0, l] d , and /+ 
has the maximal value of the integral among such functions. The integral INTd(/ + ) is the 
volume of the subset under the graph of the function / + . This subset under the graph is the 
complement in [0, of the convex hull of the points (xi, 0), (x2, 0), . . . , (x n , 0) G [0, 
and [0, l] d x {1} c [0, l] d+1 . Denoting this convex hull by C, we obtain 

INT d (/ + ) = l-voW 1 (C). 

Since the algorithm A n computes the same result for the functions /~ and / + but 
INTd(/ _ ) = we conclude that A n has error at least 

i(l-vol d+1 (C)) 

on one of these functions. Theorem [2] now follows directly from the next theorem which gives 
an estimate of the volume of the set C by setting eq = to/ 2. 

Theorem 3. Let P be an n-point set in [0, l] d x {0}. Then the (d+ l)-dimensional volume 
of the convex hull C of P U ([0, l] d x {1}) is at most 

vol d+1 (C) < (1 - t ) + (d+ 1) nt 
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for some t G (0, 1) independent of d and n. 
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Proof. Let Q = [0, l} d and Q t = [0, l] d x {t} C for t G [0, 1]. Let P cQ be an n-point 
set and let C be the convex hull of P U Q\. We want to show that 



vol d+1 (C) < (1 - t ) + (d + 1) n t ^— 

Let Ct = C fl be the slice of C at height t. For a point z = (z\, z%, . . . , zj, Zd+\) G M. d+1 
let 2 = (^i, Z2, ■ ■ ■ , Zd) be its projection onto the first d coordinates. Similarly, for a set 
M C R d+1 , let M be the set of all points z with z G M. 
Since 



voW 1 (C)= / vol d (C t )dt= / vol d (C t )dt<(l-t )+ f\o\ d (C t )dt, 
Jo Jo Jo 

it is enough to prove that 



- /lOV 
vol d (C t ) < (d+l)n f —J for all tG[0,t ]. 

Caratheodory's theorem states that any point in the convex hull of a set M in R d is 
already contained in the convex hull of a subset of M consisting of at most d + 1 points. 
Hence, every point of P is contained in the convex hull of d + 1 vertices of Qo- It follows 
that it is enough to show that 

vol d (^) < n (H) (6) 

whenever P is an n-point set of such vertices of Qq. So we assume now that P is such a set. 
Let 

w t = ((1 + t)/2, (1 + t)/2, . . . , (1 + t)/2, t) G Q t . 

For each vertex v G P, let C Qo be the intersection of the ball with center |(wo + 
and radius |||wo — v \\ with Q - Observe that Co is the convex hull of P. By Elekes' result 
from [I], 

It follows that 

C = conv(P U Qi) C (J conv(P,; U Qi) 
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since each point in this convex hull lies on a segment between a point in some B v and a point 
in Q\. Since all sets conv(S„ U Qi) are congruent, the inequality §6§ immediately follows if 
we show that ^ 

vol d (A) < f^) for all t e [0,to], (7) 

where D t = conv(B v U Qi) fl Q t is the section of the convex hull at height t. We can now 
restrict ourselves to the case that v is a fixed vertex in P, say v — (0, 0, . . . , 0, 0). 

Let O be the origin in R d . Let E t C Q be the intersection of the ball with center ^w t 
and diameter with Q. Then D 4 C E t , so ([7]) is proved once we show 

a 



vo\ d (E t ) < (^j for all te [0,t \. 
To this end we follow the approach from [2]. Set 2s = |(1 + 1). Then 



a 

vo\ d (E t )=F( K Y,(X J -s) 2 <ds' 

3=1 



where Xi, X 2 , . . . , X d are independent uniformly distributed in [0, 1]. We now use Markov's 
inequality 

P( iyi > „, < mi, 

a 

which holds for all real random variables Y and all a > 0. We take a = 1 and 



Y = exp 

and conclude that vo\d(E t ) is smaller than 

Eexp ^(ds 2 -J2( x j ~ s) 2 )^ = {^exp(a(2sX - X 2 ))Y 

where X is uniformly distributed in [0, 1] and a > is arbitrary This implies 

vol d (E t ) < (inf g(s,a)) 

\ Q>0 / 

where 

g(s, a) — / exp(a(2sx — x 2 )) dx. 
Jo 
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By continuity and the proof in [2] we find a positive to, and for each t e [0, t Q ], we find some 
positive a such that 

. 10 
g(s,a) < — , 

where 2s = |(1 + 1). Now ([H]) follows and the proof is completed. □ 



6 L p approximation 

The L p approximation problem is defined by 

APP, : F d ^ L p ([0,l} d ) with APP d (/) = / 

for F d e {F d mon , F d con } and the standard L p ([0, l] d ) space. 

The algorithms A n are now given by (P) with n : W 1 -»■ L p ([0, l] d ). The nth minimal 
error for the L p approximation problem in the worst case setting is defined by 

e^(F d ) = inf sup ||APP d (/) - A n (f)\\ Lp{[0tl]d) . 
An feF d 

For n = 0, the initial error is again |. The information complexity is now 

n^(F d , e) = mm{n \ e™(F d ) < e}. 

Note that lower bounds for integration also hold for L p approximation. Indeed, take an 
arbitrary algorithm A n for the L p approximation problem, and let 



AT(f)= / A n (f)(x)dx. 

J [0,1]* 

Then A 1 ^ approximates the integral of / and we have 

INT d (/) - A^if) = [ (f(x) - A n (f)(x)) dx 

This yields 

|lNT d (/) - AT(f)\ < [ \f(x) - A n (f)(x)\ dx < ( I \f(x) - A n (f)(x)\ p dx) 

J[0,l] d \J[0,l] d J 

Since this holds for all algorithms A n , we have 

e^(F d )<er(F d ) and n int (F d , e) < n app (F d , e), 

as claimed. In particular, the curse of dimensionality also holds for the L p approximation 
problem for both classes F d mon and F d on . 



[o,i] d 

\ Vp 
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